Engineering an Efficient Reachability Algorithm for Directed Graphs
نویسندگان
چکیده
I declare that I have developed and written the enclosed thesis completely by myself, and have not used sources or means without declaration in the text. A reachability query on a directed graph G asks if there exists a path from a node s to a node t. Answering such queries on large graph like datasets has become an issue in various fields of research and real world applications over the past 20 years. Therefore, answering reachbility queries fast and efficiently has become more and more relevant. Nowadays, most XML [2] files make extensive use of ID [11] and IDREF, which transforms their tree based layout into a more complex directed graph. Therefore, querying for reachability requires a reachability query on a directed graph instead of a simple ancestor query. Similarly, the RDF model [20] relies on directed graphs. Various queries on RDF graphs involve reachability, for example to infer the relationship between objects. Since the semantic web builds on RDF, this topic gains more attention as the semantic web becomes more popular. Network biology uses reachability queries to query for protein-protein interaction on databases like the DIP [33]. Furthermore, reachability plays a role in querying for metabolic pathways on metabolic networks [19] or interaction on gene regulatory networks [3]. Additionally , in the field of model checking [9] reachability queries are needed to check whether a state can reach another state. Similarly, source code analysis uses reachability queries for pointer and dataflow analysis [26, 25]. Since the answer of a reachability query on a directed graph, which contains a cycle covering all nodes is always true, we can reduce a directed graph to its condensation. This is done by calculating the strongly connected components and contracting them into a single node. The condensation is a directed acyclic graph (DAG) which is much smaller in most cases. The two naive algorithms to answer reachability queries for a DAG are either traversing the graph using a Depth-First Search (DFS) or a Breadth-First Search (BFS) or calculating the transitive closure of the DAG. Simply traversing the DAG results in a query time in O(m) whereas storing the transitive closure needs O(n 2) space and has a complexity of O(nm) to compute but can answer a query in O(1). Throughout the past years, numerous different approaches for graph reachability emerged. They often combine the previously mentioned two methods for reachability querying. Thus trading …
منابع مشابه
n)-Space and Polymomial-time Algorithm for the Planar Directed Graph Reachability Problem
Throughout this paper we will use n to denote the number of vertices of an input graph, which is the unique input size parameter. For a directed graph G = (V,E), its underlying graph is the undirected graph ‘G = (V, ‘E), where the vertex pair {u, v} belongs to ‘E if and only if at least one of (u, v) or (v, u) belongs to E. The planar directed graph reachability problem is a special case of the...
متن کاملNew Time-Space Upperbounds for Directed Reachability in High-genus and H-minor-free Graphs
We obtain the following new simultaneous time-space upper bounds for the directed reachability problem. (1) A polynomial-time, Õ(ng)-space algorithm for directed graphs embedded on orientable surfaces of genus g. (2) A polynomial-time, Õ(n)-space algorithm for all H-minor-free graphs given the tree decomposition, and (3) for K3,3-free and K5-free graphs, a polynomial-time, O(n )-space algorithm...
متن کاملEfficient Parallel Algorithms for Planar DAGs
We show that testing reachability in a planar DAG can be performed in parallel in 0(log n log* n) time(0(logn) time using randomization) using 0(n) processors. In general we give a paradigm for reducing a planar DAG to a constant size and then expanding it back. This paradigm is developed from a property of planar directed graphs we refer to as the Poincare index formula. Using this new paradig...
متن کاملReachability in K 3 , 3 - free and K 5 - free Graphs is in Unambiguous
We show that the reachability problem for directed graphs that are either K3,3-free or K5-free is in unambiguous log-space, UL ∩ coUL. This significantly extends the result of Bourke, Tewari, and Vinodchandran that the reachability problem for directed planar graphs is in UL ∩ coUL. Our algorithm decomposes the graphs into biconnected and triconnected components. This gives a tree structure on ...
متن کاملIndexing Public-Private Graphs
We consider the reachability indexing problem for privatepublic directed graphs. In these graphs nodes come in three flavors: public—nodes visible to all users, private—nodes visible to a specific set of users, and protected—nodes visible to any user who can see at least one of the node’s parents. We are interested in computing the set of nodes visible to a specific user online. There are two o...
متن کامل